Search CORE

439 research outputs found

Do echinoderm genomes measure up?

Author: Cameron R. Andrew
Gibbs Richard A.
Gordona Susan M.
Kudtarkar Paul
Worley Kim C.
Publication venue: 'Elsevier BV'
Publication date: 01/08/2015
Field of study

Echinoderm genome sequences are a corpus of useful information about a clade of animals that serve as research models in fields ranging from marine ecology to cell and developmental biology. Genomic information from echinoids has contributed to insights into the gene interactions that drive the developmental process at the molecular level. Such insights often rely heavily on genomic information and the kinds of questions that can be asked thus depend on the quality of the sequence information. Here we describe the history of echinoderm genomic sequence assembly and present details about the quality of the data obtained. All of the sequence information discussed here is posted on the echinoderm information web system, Echinobase.org

Caltech Authors

Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis

Author: Cameron R. Andrew
Davidson Eric H.
Gibbs Richard A.
Tu Qiang
Worley Kim C.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/10/2012
Field of study

A comprehensive transcriptome analysis has been performed on protein-coding RNAs of Strongylocentrotus purpuratus, including 10 different embryonic stages, six feeding larval and metamorphosed juvenile stages, and six adult tissues. In this study, we pooled the transcriptomes from all of these sources and focused on the insights they provide for gene structure in the genome of this recently sequenced model system. The genome had initially been annotated by use of computational gene model prediction algorithms. A large fraction of these predicted genes were recovered in the transcriptome when the reads were mapped to the genome and appropriately filtered and analyzed. However, in a manually curated subset, we discovered that more than half the computational gene model predictions were imperfect, containing errors such as missing exons, prediction of nonexistent exons, erroneous intron/exon boundaries, fusion of adjacent genes, and prediction of multiple genes from single genes. The transcriptome data have been used to provide a systematic upgrade of the gene model predictions throughout the genome, very greatly improving the research usability of the genomic sequence. We have constructed new public databases that incorporate information from the transcriptome analyses. The transcript-based gene model data were used to define average structural parameters for S. purpuratus protein-coding genes. In addition, we constructed a custom sea urchin gene ontology, and assigned about 7000 different annotated transcripts to 24 functional classes. Strong correlations became evident between given functional ontology classes and structural properties, including gene size, exon number, and exon and intron size

Caltech Authors

The genome of Apis mellifera: dialog between linkage mapping and sequence assembly

Author: Cornuet Jean-Marie
Gibbs Richard A
Li Bingshan
Monnerot Monique
Mougel Florence
Solignac Michel
Vautrin Dominique
Weinstock George M
Worley Kim C
Zhang Lan
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Two independent genome projects for the honey bee, a microsatellite linkage map and a genome sequence assembly, have interactively produced an almost complete organization of the euchromatic genome

Loss of the Polyketide Synthase StlB Results in Stalk Cell Over production in Polysphondylium violaceum

Author: Gibbs Richard A.
Kawabe Yoshinori
Kin Koryu
Kuspa Adam
Muzny Donna
Narita Takaaki B.
Richards Stephen
Schaap Pauline
Strassmann Joan E.
Sucgang Richard
Worley Kim C.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/05/2020
Field of study

University of Dundee Online Publications

Global analysis of transcription start sites in the new ovine reference genome (Oar rambouillet v1.0)

Author: Archibald Alan
Caulton Alex
Clark Emily
Clark Richard
Clarke Shannon M.
Cockett Noelle E.
Gazova Ivet
Murdoch Brenda
Salavati Mazdak
Smith Timothy P.L.
Worley Kim C.
Publication venue: 'Frontiers Media SA'
Publication date: 23/10/2020
Field of study

Edinburgh Research Explorer

An improved ovine reference genome assembly to facilitate in depth functional annotation of the sheep genome

Author: Bickhart Derek M.
Clark Emily
Cockett Noelle E.
Davenport Kimberly M.
Heaton Michael P.
Murali Shwetha C.
Murdoch Brenda
Rosen Benjamin D.
Salavati Mazdak
Smith Timothy P.L.
Worley Kim C.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 04/02/2022
Field of study

BACKGROUND: The domestic sheep (Ovis aries) is an important agricultural species raised for meat, wool, and milk across the world. A high-quality reference genome for this species enhances the ability to discover genetic mechanisms influencing biological traits. Furthermore, a high-quality reference genome allows for precise functional annotation of gene regulatory elements. The rapid advances in genome assembly algorithms and emergence of sequencing technologies with increasingly long reads provide the opportunity for an improved de novo assembly of the sheep reference genome. FINDINGS: Short-read Illumina (55× coverage), long-read Pacific Biosciences (75× coverage), and Hi-C data from this ewe retrieved from public databases were combined with an additional 50× coverage of Oxford Nanopore data and assembled with canu v1.9. The assembled contigs were scaffolded using Hi-C data with Salsa v2.2, gaps filled with PBsuitev15.8.24, and polished with Nanopolish v0.12.5. After duplicate contig removal with PurgeDups v1.0.1, chromosomes were oriented and polished with 2 rounds of a pipeline that consisted of freebayes v1.3.1 to call variants, Merfin to validate them, and BCFtools to generate the consensus fasta. The ARS-UI_Ramb_v2.0 assembly is 2.63 Gb in length and has improved continuity (contig NG50 of 43.18 Mb), with a 19- and 38-fold decrease in the number of scaffolds compared with Oar_rambouillet_v1.0 and Oar_v4.0. ARS-UI_Ramb_v2.0 has greater per-base accuracy and fewer insertions and deletions identified from mapped RNA sequence than previous assemblies. CONCLUSIONS: The ARS-UI_Ramb_v2.0 assembly is a substantial improvement in contiguity that will optimize the functional annotation of the sheep genome and facilitate improved mapping accuracy of genetic variant and expression data for traits in sheep

PubMed Central

Edinburgh Research Explorer

Concatenation cDNA sequencing for transcriptome analysis

Author: Angela M. Garcia
Jia Qian Wu
Judith F. Margolin
Kim C. Worley
Preethi H. Gunaratne
Richard A. Gibbs
Steven Hulyk
Publication venue
Publication date: 01/01/2003
Field of study

Comptes Rendus Biologies (CRBIOL)

CR Biologies

Bos taurus genome assembly

Author: Durbin K James
Gibbs Richard A
Havlak Paul
Jiang Huaiyang
Kent Matthew Peter
Lien Sigbjørn
Liu Yue
Qin Xiang
Ren Yanru
Shen Yufeng
Sodeland Marte
Sodergren Erica
Song Xing-Zhi Henry
Weinstock George M
Worley Kim C
Zhang Lan
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque. Results The assembly process consisted of multiple phases: First, BACs were assembled with BAC generated sequence, then subsequently in combination with the individual overlapping WGS reads. Different assembly parameters were tested to separately optimize the performance for each BAC assembly of the BAC and WGS reads. In parallel, a second assembly was produced using only the WGS sequences and a global whole genome assembly method. The two assemblies were combined to create a more complete genome representation that retained the high quality BAC-based local assembly information, but with gaps between BACs filled in with the WGS-only assembly. Finally, the entire assembly was placed on chromosomes using the available map information. Over 90% of the assembly is now placed on chromosomes. The estimated genome size is 2.87 Gb which represents a high degree of completeness, with 95% of the available EST sequences found in assembled contigs. The quality of the assembly was evaluated by comparison to 73 finished BACs, where the draft assembly covers between 92.5 and 100% (average 98.5%) of the finished BACs. The assembly contigs and scaffolds align linearly to the finished BACs, suggesting that misassemblies are rare. Genotyping and genetic mapping of 17,482 SNPs revealed that more than 99.2% were correctly positioned within the Btau_4.0 assembly, confirming the accuracy of the assembly. Conclusion The biological analysis of this bovine genome assembly is being published, and the sequence data is available to support future bovine research.</p

Crossref

Columbia University Academic Commons

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital Commons@Becker